Siamese Regression Networks with Efficient mid-level Feature Extraction for 3D Object Pose Estimation
نویسندگان
چکیده
In this paper we tackle the problem of estimating the 3D pose of object instances, using convolutional neural networks. State of the art methods usually solve the challenging problem of regression in angle space indirectly, focusing on learning discriminative features that are later fed into a separate architecture for 3D pose estimation. In contrast, we propose an end-to-end learning framework for directly regressing object poses by exploiting Siamese Networks. For a given image pair, we enforce a similarity measure between the representation of the sample images in the feature and pose space respectively, that is shown to boost regression performance. Furthermore, we argue that our pose-guided feature learning using our Siamese Regression Network generates more discriminative features that outperform the state of the art. Last, our feature learning formulation provides the ability of learning features that can perform under severe occlusions, demonstrating high performance on our novel hand-object dataset.
منابع مشابه
Human Pose Estimation with Regression by Fusing Multi-View Visual Information
We consider the problem of estimating 3D human body pose from visual signals within a discriminative framework. It is challenging because there is a wide gap between complex 3D human motion and planar visual observation, which makes this a severally ill-conditioned problem. In this paper, we focus on three critical factors to tackle human body pose estimation, namely, feature extraction, learni...
متن کامل3D Pose Estimation of the Face from Video
Face pose information is valuable for a variety of applications including unconstrained face recognition, natural human computer interfaces, and video database indexing. 3D pose estimation is a critical requirement for accurate face recognition using view varying representations, such as 2D intensity images. 3D pose extraction in this context requires 3D information, which is present in image s...
متن کاملAn Iterative Regression Approach for Face Pose Estimation from RGB Images
Wenye He This paper presents a iterative optimization method, explicit shape regression, for face pose detection and localization. The regression function is learnt to find out the entire facial shape and minimize the alignment errors. A cascaded learning framework is employed to enhance shape constraint during detection. A combination of a two-level boosted regression, shape indexed features a...
متن کاملFrom Depth Data to Head Pose Estimation: A Siamese Approach
The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose...
متن کاملCamera Pose Estimation in Unknown Environments using a Sequence of Wide-Baseline Monocular Images
In this paper, a feature-based technique for the camera pose estimation in a sequence of wide-baseline images has been proposed. Camera pose estimation is an important issue in many computer vision and robotics applications, such as, augmented reality and visual SLAM. The proposed method can track captured images taken by hand-held camera in room-sized workspaces with maximum scene depth of 3-4...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1607.02257 شماره
صفحات -
تاریخ انتشار 2016